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CLUSTERING IN A STOCHASTIC MODEL OF 
ONE-DIMENSIONAL GAS 

By Vladislav V. Vysotsky 1 

St. Petersburg State University 

We give a quantitative analysis of clustering in a stochastic model 
of one-dimensional gas. At time zero, the gas consists of n identical 
particles that are randomly distributed on the real line and have zero 
initial speeds. Particles begin to move under the forces of mutual 
attraction. When particles collide, they stick together forming a new 
particle, called cluster, whose mass and speed are defined by the laws 
of conservation. 

We are interested in the asymptotic behavior of K n (t) as n — > oo, 
where K n (t) denotes the number of clusters at time t in the system 
with n initial particles. Our main result is a functional limit theorem 
for K n (t). Its proof is based on the discovered localization property 
of the aggregation process, which states that the behavior of each 
particle is essentially defined by the motion of neighbor particles. 

1. Introduction. 

1.1. Description of the model. We give a quantitative analysis of clus- 
tering in a stochastic model of one-dimensional gas. At time zero, the gas 
consists of n point particles, each one of mass These particles are ran- 
domly distributed on the real line and have zero initial speeds. Particles 
begin to move under the forces of mutual attraction. When two or more 
particles collide, they stick together forming a new particle, called cluster, 
whose mass and speed are defined by the laws of mass and momentum 
conservation. Between collisions, particles move according to the laws of 
Newtonian mechanics. 

We suppose that the force of mutual attraction does not depend on dis- 
tance and equals the product of masses. This assumption is natural for 
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one-dimensional models because, by the Gauss law applied to flux of the 
gravitational field, gravitation is proportional to the distance to the power 
one minus dimension of the space. At any moment, the acceleration of a 
particle is thus equal to difference of masses located to the right and to the 
left of the particle. 

Random initial positions of particles are usually described (see [8, 16, 
25]) by the following natural models: in the uniform model, n particles are 
independently and uniformly spread on [0, 1]; in the Poisson model, particles 
are located at points ^<Si, ■ ■ ■ , hS n , where Si is a standard exponential 
random walk. In other words, particles are located at points of first n jumps 
of a Poisson process with intensity n. 

These two models are the most natural and interesting; let us call them 
the main models of initial positions. However, we will see that behavior 
of the Poisson model is essentially defined by independence of initial dis- 
tances between particles rather than by the particular type of the distances' 
distribution. Therefore, it is of a great mathematical interest to general- 
ize the Poisson model by introducing the i.d. model, where "i.d." stands 
for "independent distances," as follows. Particles are initially located at 
^Si, ^£2, • • • , n^ni where Si is a positive random walk whose nonnegative 
i.i.d. increments Xj satisfy the normalization condition EAj = 1. Note that 
if we proceed to the limit as n — > 00, we consider a system of total mass one, 
which consists of, roughly speaking, infinitesimal particles homogeneously 
spread on [0, 1]; this is true for all the mentioned models of initial positions. 

The mathematical interest in sticky particles systems arises mainly from 
relations between these systems and some nonlinear partial differential equa- 
tions originating from fluid mechanics, for example, the Burgers equation. 
These equations admit interpretation in terms of sticky particles; see Gur- 
batov et al. [10], Brenier and Grenier [4] or E, Rykov and Sinai [6]. Sticky 
particles models are also used for numerical solving of other partial differen- 
tial equations; see Chertock et al. [5] for explanations and further references. 

As time goes, particles aggregate in clusters. Clusters become larger and 
larger while the number of clusters decreases until they merge into a single 
cluster containing all initial particles. This process of mass aggregation is 
strongly connected with additive coalescence; see Bertoin [2] and Giraud [9] 
for the most recent results and references. 

The aggregation process resembles formation of a star from dispersed 
space dust and sticky particles models indeed have relations to astrophysics. 
It is appropriate to clarify these relations since they are not so direct and 
cause a lot of misunderstanding. 

It is known that the distribution of galaxies in the universe is very inho- 
mogeneous and the regions of high density form a peculiar cellular structure. 
The first attempt to understand the formation of such structures was made 
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in 1970 by Zeldovich. Most of the mass in the universe is believed to ex- 
ist in the form of particles that practically do not collide with each other 
and interact only gravitationally, for example, neutrinos. In his model, Zel- 
dovich considered an initially homogeneous collisionless medium of particles 
moving by pure inertia; the gravitational interaction was taken away by an 
appropriate time change. He showed that singularities, that is, the thin re- 
gions of very high density of particles, so called "pancakes," appear even if 
initial speeds of particles form a smooth velocity field. 

Zeldovich's approximate model, however, does not explain formation of 
the cellular structure of matter. His approximation does not take into ac- 
count that particles hitting a "pancake" are hampered by its strong gravita- 
tional field and start oscillating inside the "pancake" instead of flying away. 
Although this gravitational adhesion of collisionless particles is not precisely 
the same as the real sticking, the model of sticky particles serves as a reason- 
able approximation. The effect of gravitational adhesion was then analyzed 
by the use of the Burgers equation; Gurbatov, Saichev and Shandarin pro- 
posed it in 1984 to extend Zeldovich's approximation, which is invalid after 
formation of "pancakes." 

The model of sticky particles is directly mentioned in Gurbatov et al. [11]; 
a comprehensive survey of the formation of the Universe's large-scale struc- 
ture could be found in Shandarin and Zeldovich [23]. 

1.2. Statement of the problem and the results. In general, the problem 
is to describe the process of mass aggregation. How fast is it? How large 
the clusters are? Where do clusters appear most intensively, and so forth? 
Numerous papers on the model (e.g., [8, 14, 16, 20, 25]) are dedicated to 
probabilistic description of various properties of the aggregation process as 
the number of initial particles n tends to infinity. Thus, the behavior of a 
typical system consisting of a large number of particles is studied. 

In this paper, we are interested in the asymptotic behavior of K n (t), which 
denotes the number of clusters at time t in the system with n initial particles. 
This variable is a decreasing random step function satisfying K n (0) = n 
and K n {t) = 1 for t > T^ ast , where T^ ast denotes the moment of the last 
collision. While calculating K n (t), we also count initial particles that have 
not experienced any collisions; in other words, K n {t) is the total number of 
particles existing at time t. 

It is very important to know the behavior of K n (t). This gives us a deep 
understanding of the aggregation process since the average size of a cluster 
at time t is 

At first we give a short deterministic example. Suppose that particles are 
located at points ~j • • • j that is, Si = i. By simple calculations, we find 
that there would not be any collisions before t = 1. At the moment t = l, all 
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particles simultaneously stick together, hence K n (t) = n for < t < 1 and 
K n (t) = 1 for t>l. 

However, when the initial positions are random, the aggregation process 
behaves entirely differently. In [25], the author proved the following state- 
ment. 



Fact 1. There exists a deterministic function a(t) such that both in the 
Poisson and the uniform models of initial positions, for any t>0, we have 

n\ K n (t) p 

(1) ► aft), n — > oo. 

n 

The function a(t) is continuous, a(0) = 1, and a{t) = for t > 1. We conjec- 
ture, on the basis of numerical simulations, that a(t) = 1 — t 2 for < t < 1. 

The relation a(t) = for t > 1 is not of a surprise because we know from 

Giraud [8] that both in the Poisson and the uniform models, T„ — ► 1 (the 
limit constant is so "fine" due to the proper scaling of the model). Therefore, 
we say that the moment t = 1 is critical; note that this moment coincides 
with the moment of the total collision in the deterministic model. 

The aim of this paper is to strengthen the result of [25]. We first gen- 
eralize Fact 1 and prove it for the i.d. model. We will see [relations (19) 
and (27) below] that a(t) is equal to the probability of a certain event that 
is expressed in terms of X{. Also, we will prove that a(t) depends on the 
common distribution of X{ as follows: a(t) = 1 on [0, y/Ji), where 

Li:=sup{y.F{X l <y} = 0}; 

a(t) € (0, 1) on (y/JI, 1); and a(t) = on (1, oo). 

Furthermore, the recent results of the author [26] allow us to prove the 
conjecture from Fact 1 that a Poiss (t) = a Unif (t) = 1 - 1 2 for < t < 1. There 
is an amazing contrast between the simplicity of this formula and the hard 
calculations one needs to obtain it. It is remarkable that now we know the 
limit function a(t) for the main models of initial positions. 

Our main goal is to improve (1) by finding the next term in the asymp- 
totics of K n (t). The result is the following statement, where the standard 

symbol — ► denotes weak convergence and D denotes the Skorohod space. 

Theorem 1. In the i.d. model with continuous X{ satisfying ^Xj < oo 
for some 7 > 4, there exists a centered Gaussian process K(-) on [0, 1) such 
that 

K n {-)-na{-) w.x in D[0,1 - e] for all ee (0,1) 
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as n — > oo. The process K(-) depends on the distribution o/Xj. This pro- 
cess satisfies K(0) = and has a.s. continuous trajectories. The covariance 
function R(s,t) of K(-) is continuous on [0, l) 2 , R(s,t) > on (^Ji,l) 2 , and 
R(s,t) = on [0,1) 2 \(vfol) 2 . 

In the uniform model, (2) holds for some centered Gaussian process (■) 
on [0, 1). This process satisfies ET Umf (0) = and has a.s. continuous trajecto- 
ries. The covariance function i? Umf (s,t) o/i^ Umf (-) is continuous on [0, l) 2 , 



and R Vnii (s,t) = R Poiss (s,t) - s 2 t 2 . 

Thus, the Poisson and the uniform models lead to different limit processes 
K Poiss (-) and £T Unif (-), although a Poiss (-) = a Unif (•). 

As an immediate corollary of Theorem 1 (see Billingsley [3], Section 15), 
we get 

(3) Kn{t) - na{t) ^M(0,a 2 (t)), rwoc 

for any t < 1, where o~ 2 (t) := R(t,t). It is possible to show that in the i.d. 
model, (3) holds for all t ^ 1 under the less restrictive condition EX 2 < oo, 
with o~ 2 (t) = for t > 1; continuity of Aj is not required. 

We also study convergence of the left-hand side of (3) at the critical 
moment t = l. Apparently, the limit is not Gaussian, but this complicated 
problem is related to a curious, but hardly provable conjecture on integrated 
random walks. In view of this non-Gaussianity, it seems impossible to prove 
any extended version of Theorem 1 that describes the weak convergence of 
trajectories on the whole interval [0,1]; we refer to Section 7 for further 
discussion. 

We finish this subsection with a note on scaling. In our model, the masses 
of particles are equal to ^ and the distances between them are of the order 
— . Let us rescale the i.d. model by multiplying all masses and distances 
by n: the system of particles of mass one each, initially located at points 
Si ~ S[ n /2],S2 — S\ n j2\ , ■ ■ ■ , S n — S\ n /2], is called the expanding model. The 
particles are shifted by S , [ n /2] because we want the system to expand "filling" 
the whole line as n — > oo rather than only the positive half-line. 

All results of our paper hold true for the expanding model. This is not 
unexpected because the shift does not produce any changes and the rescaling 
of masses is equivalent to the time contraction by n times while the rescaling 
of distances is equivalent to the time expansion by n times. We refer the 
reader to Section 2 below or to Lifshits and Shi [16] for rigorous arguments. 



1.3. Organization of the paper. In Section 2 we describe a general method 
which is used to study systems of sticky particles. This method is applied for 
studying the i.d. model in Section 3, where we investigate some properties of 
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the aggregation process. We will show that the aggregation process is highly 
local, that is, the behavior of a particle is essentially defined by the motion 
of neighbor particles. This localization property suggests that we could use 
limit theorems for weakly dependent variables to prove both Fact 1 and 
Theorem 1 for the i.d. model; this will be done in Section 4. Then we will 
prove Theorem 1 for the uniform model in Section 5. In Section 6 we study 
the number of clusters at the critical moment t = 1. Some open questions 
are discussed in Section 7. 

2. Method of barycenters. In this section we briefly describe the method 
of barycenters, which is the main tool used to study systems of sticky par- 
ticles; it is also applicable to more general models where particles could 
have nonzero initial speeds and different masses. The method of barycenters 
was independently introduced by E, Rykov and Sinai [6] and Martin and 
Piasecki [20]. 

Let us start with several definitions. We always numerate particles from 
left to right and identify particles with their numbers. A block of particles is 
a nonempty set J C [l,n] consisting of consecutive numbers. For example, 
the block (i,i + k] consists of particles i + . . . ,i + k. Note that there are not 
any relations between blocks and clusters: for example, a block's particles 
could be contained in different clusters and these clusters could even contain 
particles that do not belong to the block. 

It is convenient to assume that initial particles do not vanish at collisions 
but continue to exist in created clusters. Then the coordinate x^ n {t) of a 
particle i could be defined as the coordinate of a cluster that contains the 
particle at time t. The second subscript n always indicates the number of 
initial particles; we will omit this subscript as often as possible. 

By xj(t) := J2ieJ x i(t) denote the position of the barycenter of a 
block J at time t. Further, define 



where Mj := n _1 (n — maxjgj) and Mj := n (muxjgj — 1) are the to- 
tal masses of particles located to the right and to the left of the block J, 
respectively. 

A block is free from the right up to time t if, up to this time, the block's 
particles did not collide with particles initially located to the right of the 
block. We similarly define blocks that are free from the left and say that a 
block is free up to time t if it is both free from the right and from the left. 

The next statement plays the key role in the analysis of sticky particles 
systems. The barycenter of a free block moves as an imaginary particle con- 
sisting of all particles of the block put together at the initial barycenter. In a 
more precise and general way, we state the following. 



xH*):=zj(0) + |(M 



(R) 
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Proposition 1. If a block J is free from the right (resp. left) up to time 
t, then xj(s) > Xj(s) for s £ [0,t] [resp. xj(s) < x*j(s)]. If a block J is free 
up to time t, then xj(s) = x*j(s) for s £ [0,t]. 

This statement could be found, for example, in Lifshits and Shi [16], 
Proposition 4.1. The easy proof is based on the property of conservation of 
momentum. 

The moment when a particle j sticks with its right-hand side neighbor 
j + 1 is called the merging time T^ n of the particle j. In other words, Tj tU 
is the first moment when particles j and j + 1 are contained in a common 
cluster; here j £ [l,n— 1]. Proposition 4.3 from Lifshits and Shi [16], which 
is stated below, gives us a way to calculate Tj,n- 

PROPOSITION 2. For every j £ [l,n— 1], we have 

(4) T hn = .mini S > : x* m (s) = x* {lJ] (s)}. 

0<l<j 

Thus, Tj n is expressed by means of barycenters. Note that since 

k — I 

( 5 ) x *j,k] ( s ) - x *(i,j] ( s ) = x (j,k] (0) - X(z j] (0) - ~^-s 2 , 

each of the equations x% k As) = x%^(s) has a unique nonnegative solution. 
We also mention that at the moment T^ n appears a cluster that consists of 
the particles I + 1, . . . , k, where k and I are minimizers of the right-hand side 
of (4). 

We will prove Proposition 2 since the proof is simple and perfectly illus- 
trates the sense of the method of barycenters. 

PROOF of Proposition 2. For any u < Tj >n , the particles j and j + 1 
are contained in different clusters. Therefore, for every I <j, the block [l,j] 
is free from the right up to time u, and for every k > j, the block [j + 1, k] 
is free from the left. By Proposition 1, 

X*lJ](u) < XQ d] (u) < Xj( U ) < x j + 1 (u) 

< x {jM (u) <x* {jtk] (u), 

and since, by (5), the function Xq k ,(s) — ^(s) is decreasing for s > 0, we 
conclude that 

u<{s>0:x* ijM (s)=x* {lJ] (s)}. 

Taking minimum over k, I and taking supremum over u, we get Tj n < 
min{- • •}. 
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Let us prove the last inequality in the other direction. By the definition of 
Tj >n , there exist an I < j and a k > j such that the blocks and (j, k] are 
free up to time Tj n (clusters containing particles from these blocks collide 
exactly at time 7j-, n ). In view of Proposition 1, 

X *l,j] (. T j,n) = x {l,j]{Tj,n) = x {j,k]{Tj,n) = x (j,k]( T j,n)i 

hence Tj jU = {s > : x*^ (s) = x*^ (s)} and T^ n > min{- • •}. □ 

3. Study of the i.d. model. The localization property. At first, note that 

n-1 

(6) K n (t) = l + J2Ht< Tl , n} 

because the total number of clusters decreases by one at each moment Tj )n . 
This representation plays the key role in the investigation of K n {t). Clearly, 
we need to study properties of the r.v.'s T i n to prove limit theorems for 
K n (t); such study will be done in this section. 

3.1. The initial study. Let us simplify the representation for Tj n from 
Proposition 2. In this section we consider the i.d. model of initial positions, 
where Xj :Tl (0) = -Sj. Recall that Sj is a random walk with i.i.d. increments 
{Xj}j £ z (we will need the variables {Xj}j<Q later). 

Rewrite the initial distance between barycenters as 

x(j,k](°) ~ x (i,j](°) 

1 k 1 1 J 1 

- E -Si-^-j E 



k — j .^n j — l.^rf.n 
J i=j+i J 1=1+1 

= \ ( ^~ E (St - S j+1 ) + £ (S 3 - ft) + (S j+1 - Sj)) 

l / 1 fc-i-i 1 j-i-i > 

= n\k~^] ? ( S i+ i+1 ~ S i +1 "> + ~] E (Sj - Sj-i) + Xj +1 

let us agree that ^0 := 0. Further, by 

x (j,k}(°) ~ x Q,j](°) 

1 / 1 k-j-l j+i+l -, j'-i-l j \ 

= Uk=j E E + E E 

\ i=l rn=j+2 i=l m=j—i+l / 

1 / 1 

= n{]—] g ^-i-W+i+l 
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•> 1 i=l / 



and (5), we have 



n 

where 

F PJ,ff( s ) := - 22(P - j ) X 3+i+l 
P i=l 

" i=i 

(for p,q>l and j G Z). Now, by Proposition 2, we get 

Tj n = min {s > : F k _ j:j j _i(s) = 0} 
j<fe<n 

0<Z<j 

(8) = min {s>0:F k>jil (s) = 0}. 

l<k<n—j 
l<l<j 

Note that F pj)5 (0) > for all p,j,q and F p j tq (s) is decreasing for s > 0. 
This function could be also written in the more convenient form: 

l p_1 

f p,jA s ) = - £(p - OC^j+i+i - fi2 ) 

(9) ? _ i 

* i=i 

3.2. Localization property of the aggregation process. We see that T^ n 
is a function of X2, ...,X n ; in other words, it is necessary to know the 
distances between all n particles to find Tj tH . The aggregation process is 
actually highly local, that is, the value of Tj n is essentially defined by the 
initial distances between neighbor particles {i} of j for which \ j — i\ is small 
enough. 

To make this statement rigorous, we need to introduce the following no- 
tation. Let us put 

T\ M) := min {s > :F kJ>l (s) = 0}, j G Z, Af G N, 

which is expressed in terms of the variables {Xi}y_^ <M only. Also, define 
Tj := inf {s > : F kjj>l (s) =0}, j G Z, 

K,(>1 
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which is, in some sense, the merging time in an appropriate infinite system 
of particles. The reader could construct such system by considering the limit 
of the expanding model, see Section 1. 
It is clear that 

(10) Tj < Tj tTl < T\ 3 An ~ j) , j, n G N, j < n, 

where by A and V we denote minimum and maximum, respectively, and 

(11) Tj<t!> m) , jeZ,Af€N. 

Let us estimate the rate of the convergence of ¥{Tj ^ Tj M ^} to zero as the 
"radius of the neighborhood" M tends to infinity. We thus could "measure" 
the above-mentioned locality of the aggregation process. In fact, by (10), we 
have P{T j; „ / t\ M) } < F{Tj ^ rj M) } for any n G N, j < n, and M < j An- j. 



Lemma 1. Suppose KXj < oo for some 7 > 1. Then there exists a non- 
decreasing function p(t) such that 

(12) max(P{i {t < T . } + l {t ^ M)} }MT, + rf 7) ,rj M) < t}) < p(t)M 1 ^ 

for any t G (0, 1), j € Z, and M G N. Moreover, for any t < 1, the left-hand 
side of (12) is o(M 1 "T). 



Proof. Let us estimate the first probability in the left-hand side of 

>ns ( 

Mi 



(12). By properties of Fk,j,l(') an d definitions of Tj and of Tj 



f{Ht<T j} + \ t < T W } } = HTj < t < Tf>} 

= p{ inf F kJi i(t) < 0, min F kJ>l (t) > j. 
{k,l>l ' J ' l<k,l<M ,J ' J 

By (9), this expression does not depend on j, and putting j := — 1, 



3 

fc-1 



l> — 1=1 

1 l ~ l 

— i=l 



fc-1 

l<k<M k 



i=l 
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1 l ~ l 1 
+ i min f j £(Z - - t 2 ) + (X - t 2 ) > . 

i=l ) 

We then compare the inequalities in the braces and obtain 

F{l( 1 <T J )#l„ STj (.u, ) } 

£ *(« i £(* - o w - * 2 ) < l3 i? M ; E(* - - * 2 )} 

k 1=1 1=1 ) 

= A j* I X> - « 2 ) < , mm i g(5, - «»)} 

{-, fc-1 fc— 1 "J 

inf T Y(Si -it 2 ) < min - V (Si - it 2 ) I . 
k>M k k£{l,M} k J 

Now rewrite the event in the last line as 

ja* > M : I Y;(Si - it 2 ) < min ^0, ^ ^(S, - it 2 )^) | 
t i Af-l 

= |3fc>M:- £j (Si -i* 2 ) 

+ - ^ (S, - it 2 )< min 0,- YiSi-it 2 )) . 

i=M \ i=l / J 

Analyzing both cases < ^ E^lH^i - and > jj E^lH^i - ^ 2 )> w e 
conclude that the considered event implies 

hk>M:jY. {Si-it 2 )<o\ = (a&>M: ^ (Si - it 2 ) <o|. 
Clearly, the latter implies 

{3i > M:Si - it 2 <0} = ( inf — <t 2 l; 

[i>M I J 

hence, combining all the estimates together, we get 

(13) nt {t <T j} + l {4 <^, } } < 2p{inf | < t 2 }. 

Note that we obtained (13) without any assumptions on the moments of JQ. 

We now estimate the right-hand side of (13); recall that EXj = 1. Then 
the first part of (12) immediately follows from the classical result of Baum 
and Katz [1] (see their Theorem 3 and Lemma): 
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Fact 2. IfEXi = a and E|Xj| 7 < oo for some J>1, then 



< sup 


s l 

a 


>e} = o(k l -i), 


k- 


-> oo 


U>fc 


i 









for any e > 0. 7n addition, the series YlkLi IP{sup i > fc |-4 — a| > e} converges 
for all £> if j = 2. 

The estimation of the second probability in the left-hand side of (12) is 
completely analogous, since 

{T^Tf\ T M<t} 

= {Tj < rj M) < t} 

We put j := — 1, repeat the estimates, and get 

p{Tj t^ M \t\ M) <t}< 2F{3i >M:Si- i[T<J°] 2 < 0, T^f ) < t} 

instead of (13). The right-hand side does not exceed 2P{3i > M : Si — it 2 < 
0}, hence 



(14) F{Tj / rj A/) , rj M) < t } < 2p| inf y < t 2 1 . 



□ 



3.3. The distribution function of To in the Poisson model. It is amazing 
that in the Poisson model, the distribution function of To could be found 
explicitly. This is important because by (27) below, the limit function a(t) 
equals P{To > t} for the i.d. model. Also, in the proof of Theorem 1 for the 
uniform model, we will need a Poiss (t) = PITq* 0188 > t} to be twice differen- 
tiable and have a continuous second derivative. 

Lemma 2. In the Poisson model, for < t < 1, we have 
(15) F{T >t} = l-t 2 . 

In addition, for t>0, n>2, and 1 < j < n — 1, we have 

nT h n >t} = e t2 F I mm ^(Si - it 2 ) > 0^ 

(16) 



Kk<j~ 
(. - - J i=l 

4 mm Y{Si-it 2 )>o\. 



where Si is a standard exponential random walk. 



CLUSTERING IN A STOCHASTIC MODEL OF ONE-DIMENSIONAL GAS 13 
PROOF. We start with (16). By (8), (9) and properties of F k jj(-), 



P{T i)n >t} = p{ mill F kJtl (t)>o\ 

ll<k<n-j J 



l<l<j 

(17) =p{ i< min ±V>- i)W+(+1 -f<) 

1 = 1 



j fe-1 



1 ' _1 1 

+ y E( z - - * 2 ) + x 3+i - 1 2 > o j • 

In the right-hand side of the last equality, by Y denote the first minimum 
and by Y denote the second one. 

Suppose A" is a standard exponential r.v., Z is a nonnegative r.v., and 
that X and Z are independent; then 

;>oo 

F{Z<X} = / F{Z<x}e~ x dx 
Jo 

= J Et{ Z < x ye~ x dx = E y l{ Z < x }e~ x dx = Ee~ z . 

Hence in view of independence of Y, Y, Xj + i we get 

F{Y + Y + X j+1 -t 2 >0} = Ee Y+t ~ t2 = e^Ee^Ee 9 ^; 
and therefore, 

H T j,n >t} = e f2 F{Y + X j+1 - t 2 > 0} • F{Y + X j+1 -t 2 > 0}. 
Now, by 

F{Y + X j+ i - t 2 > 0} 



(18) 



mm 

i<l<j ' 



mm 1 £(I - ~ **) + " t 2 > 

Z i=i J 

£(J - i)(X i+l - t 2 ) + /(X! - t 2 )j > j 

= p|mmXa-^ + l)(^-* 2 )>o|, 

we conclude the proof of (16). Indeed, the expression in the last line equals 
the first probability in the right-hand side of (16). 
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Now let us prove (15). From the definition of T and 2q we see that 
l| t<T Wj — > l{t<r } a - s - as k — > oo; then by (16), 

P{T > t} = e*V j inf ^(Si - ^ 2 ) > o|. 
Then we need to check that 



»|ig^(Si-«)>o| = vT 



for < i < 1. The complicated calculations of this probability take more 
then ten pages. Therefore, they were separated into independent paper [26]. 
Although these calculations seem to be technical, they are based on quite 
original ideas. □ 

3.4. Some properties of the variables Tj. In this subsection we prove 
several important properties of the r.v.'s Tj. 

1. The sequence Tj is stationary. 

Proof. This statement immediately follows from the definition of 2$ and 
stationarity of Xi, which are i.i.d. 

2. The common distribution function of Tj is defined by 

{k— 1 
- i=i 

(19) 

+ £f y E( Z - - **) + (^o - * 2 ) > o} . 

— «=i ) 

Proof. This formula follows from (9). 

3. We have P{^//7 < T t < 1} = 1 while sup{y :P{T; < y} = 0} = y^Z and 
inf{y :F{Ti < y} = 1} = 1; recall that // = sup{y :P{X; < y} = 0}. In addi- 
tion, if < BXi < oo, then P{T; = 1} = 0. 

Proof. First, P{y7* < ^} = 1 is trivial, because both infima in (19) are 
nonpositive. 

Second, fix a t > 1 and consider P{Tj > t}. Taking into account that infima 
in (19) are nonpositive, we obtain 

fc-i 



P{T< >t}< pjinf I J2(k - i)(Xi - t 2 ) + (X - t 2 ) > j. 
Then by the same arguments as in (18), 

P{?i > *} < P j inf J2( k ~i + l)(Xi- t 2 ) > j = p| inf J^(5< - it 2 ) > o|. 
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By the strong law of large numbers, this probability is zero for all t > 1. 
If t = 1 and < BXi < oo, then 

p{ inf V (S^ - i) > ol = lim p{ min V (S, - i) > j 

1 -A Si — % 



= lim P<^ min - V / > }, 

n^oo ^ i< fc <„ n -y/nBXi ~ J 

and from the invariance principle, we get 

P{7i > 1} < p( min / W(u)du>o\. 

[0<s<lJo ) 

It follows from the asymptotics of unilateral small deviation probabilities of 
an integrated Wiener process, see (43) and (44) below, that the last expres- 
sion equals zero. 

Third, sup{y : P{T; < y} = 0} = y/Ji and inf {y : P{T, < y} = 1} = 1 follow 
if we prove that for any t < EJQ = 1, the common distribution of the i.i.d. 
infima in (19) has an atom at zero. But we have 

r i fc-i ~> f i fe-i ~i 



mf-E(^-^-t 2 )=0^Pjmf-E(^-^)=0 

Si 



i=i ) \ i=\ 



> P<j inf — > t 2 
[i>i i 

and it could be shown via the strong law of large numbers that the last 
probability is strictly positive for all t < 1. 

4. Suppose Xi is continuous. Then Tj and Tj n are continuous for any 
j,k,n and the common distribution of Tj could have an atom only at 1. In 
addition, if EX? < oo, then Tj are continuous. 

Proof. By (7) and (8), 



(20) T jtn = min JH(k,j,l), 

l<k<n—j v 
1<1<3 



where 



2 . I 9 " 1 



H(p,j, q) ■■= — — [-J2(P- + - S(? " *)*i-<+i + ) • 

" \ i=l i=l / 

Hence Tj n is continuous as a minimum of a finite number of continuous 
r.v.'s. The T- are also continuous because T- fc ^ =T^^k- 
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Now we prove the continuity of Tj. By Property 3, it only remains to 
verify that F{Tj > t} is continuous on [0, 1). But P{rj fc) >t}- F{Tj >t} = 
P{l{t<T.,} / t {t<T ( k)} }, and in view of (13), 

sup |P{rJ fc) > t}-P{T 7 - > t}\ < sup 2P( inf — < t 2 ) = 2P( inf — < s 2 } 

0<t<s J 0<t<s lm>k m J [m>k TXl J 

for every s < 1 = EX; . The last expression tends to zero by the strong law 
of large numbers; then ¥{Tj > t} is continuous on [0, s] as a uniform limit 

of continuous functions PjT^ > t}. Since s < 1 is arbitrary, ¥{Tj > t} is 
continuous on [0,1). 

5. The cov(l| s<Ti) }, l{ t< T k }) tends to zero as k — > oo for all s,iG [0, 1). If, 
in addition, ^Xj < oo for some 7 > 1, then for any s,t £ [0, 1) and k £ N, 
we have 

(21) |cov(l {s < To} , l {t < Tfe} )| < 2T(p( S ) + p{t))k x -~«. 

Proof. The idea is to approximate 1{ s <t } and t{ t <T k } by 1, <T (/t/2), and 
1, (fc/2), , respectively; here by /c/2 we mean \k/2], where \x] = min{m £ 

i t — 1 k J 

Z : m > 2;}. Note that 1. (it/2)-, and 1 ( k / 2 ) x are independent because the 

first is a function of {Xi} i<k / 2 while the second is a function of {Xj} i>fc / 2 +i- 
We then have 

|cov(l {s < To} ,l {4 < Tfc} )| 

= |COV(1 {S < TO} , t{t<T k }) ~ CQV(1 { (fc /2) } , l {t < T (fc/2) } )l 
< |E(l {s < To }%*<T fc } " 1 {s <T (*/ 2 )} 1 {t<T^})l 

(22) + |E(l {s < To} - l {s < T ( fc /2) } )| + |E(l{ t <T fc} - l {t < r (*/*) } )l 
But 

¥ { 1 {s<T } 1 {t<T k } + 1 { s < T ( k /V} 1 {t<Tl k 'V}} 

< P{l {s <r } / 1 {s <t^/2) } U ! {4 < Tfc} / t {t ^ k/2)} }, 

therefore the result follows from Lemma 1. 

6. The r.v.'s {T,} ieZ , {?f } }i 6 z, and {T^Z 1 are associated; the author 
owes this observation to M. A. Lifshits. 
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Proof. Let us first recall the definition and some basic properties of as- 
sociated variables. R.v.'s £1, . . . ,£ m are associated if for any coordinate- wise 
nondecreasing functions /, g : M. m — > R, it is true that 

cov(/(£i,...,£ m ),3(£i,...,£ m )) > 

(assuming that the left-hand side is well defined). An infinite set of r.v.'s is 
associated if any finite subset of its variables is associated. 

The following sufficient conditions of association are well known; see [7]. 

(a) Independent variables are associated. 

(b) Coordinate-wise nondecreasing functions (of finite number of argu- 
ments) of associated r.v.'s are associated. 

(c) If the variables £1^, . . . , £, rri: k are associated for every k and (£i,fc, . . . , 

£,m,k) — ► (6, • • • , Cm) as k —> 00, then fi, . . . , £ m are associated. 

(d) If two sets of associated variables are independent, then the union of 
these sets is also associated. 

Then {Tj^}™^ 1 are associated for every n by (a), (b) and (20). Analo- 
gously, {T^}i & z are associated for every k. Finally, since — > Tj a.s. as 
k — ► 00 for every i, (c) ensures the association of {Tj}j g z. 

7. For any s, t £ R and k G Z, 

(23) cov(t {To < s} ,l {Tfc < t} )>0. 

Proof. This inequality follows from cov(l{r 0<s }, t{ Tk<t y) = cov(l| s<To |, 
l{t<T fe })> the association of To,Tfc and (b). 

8. If EA.7 < 00 for some 7 > 2, then the stationary sequence min{Tj,£} 
is strongly mixing for any t < 1 and its coefficients of strong mixing a{k) 
satisfy a(k) = o(k 2 ~ J ). 

Proof. Recall that stationary r.v.'s £j are strongly mixing if a(k) — > as 
— > 00, where a{k) are the coefficients of strong mixing defined as 

a{k):= sup |P(AB) - P(A)P(B)|; 

here := cr(£oi£-ii • • •) an d .T 7 /? := 0"(£fc,£fc+ij . . .) are the a-algebras of 
"past" and "future," respectively. It is readily seen that 

(24) a(k)< sup I COT(/(f ,f-i,...)>s(&,&+i. •••))!. 

0</,9<l 

where the supremum is taken over Borel functions f,g:M.°° — > [0,1]. 

Let us estimate a(k) in the same way we estimated the left-hand side 
of (21). Fix some Borel functions /, g : M°° — ► [0,1]. We approximate the 
variables from the "past" T At, T_ x At, T_ 2 At, . . . by T^ k/2) At, T^{ 2+1) At, 
rp(k/2+2) ^ ^ ^ respectively; and for the variables from the "future," we 
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use the analogous approximation. Now, f{T^ 2 ^ A t,T^/ 2 At,...) and 
g^j-,(k/2) ^ ^ j,{k/2+i) ^ ^ ^ are j nc j e p endent because the first is a function 
of {^«}j<fc/2 and the second is a function of {^Q}i>fe/2+i- We then argue in 
the same way as in (22) to get 

| cov(/(T A t, T_i At,.. .),(/(!* A t, T k+1 At,.. .))| 
< 2p| p (T_i At) / (T^ /2+l) At)} 

+ 2P| Q (T fe+i A t)^{T^ +i) At)} 

oo 

<4 £ P{(T At)/(T W At)}. 

i=k/2 

Now, by the formula of total probability, we have 

P{(T At)/(T W At)} 

= P{(T At) / (T W A t), T W > t} + P{(T At) / (T W A t), T W < t) 
< P{l { i<r o} / l {t < T W } ) + nn + T® , T (J) < t} 

and combining all the estimates together, by Lemma 1 (24) and arbitrariness 
of / and g, we get a{k) < 8J2Zk/2°( il ^) = oik 2 ""*) if 7 > 2. For 7 = 2, we 
get a(k) < 16X^/c/2^{ m fi>A/ i 1 < = using the same argument and 
applying (13), (14), and Fact 2 instead of Lemma 1. 

3.5. TTie last collision. We finish this section with a statement on the 
convergence of the moments of the last collision. 

Proposition 3. In the i.d. model, T^ ast 1 as n — > 00 i/EJT 4 2 < 00. 

This result is well known for the Poisson model; see Giraud [8]. 

Proof of Proposition 3. Let us first prove that P{T^ ast > t} —> as 
n — > 00 for all t > 1. Since Z^ ast = maxKjXn-i Tj n , we have 

n-l 

(25) P{T ? l ast > t} < ]T P{T i; „ > t}. 
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By taking into account that the minima in (17) are nonpositive and by 
arguing as in (18), 

HTj,n>t}<A min lf](k-i)(X j+i+1 -t 2 ) + X j+ x-t 2 >o\ 



= p{ min Y(k-i + l)(Xi - t 2 ) >oj 

<p{ min YtSi-it 2 ) >o). 
We claim that (without any assumptions on the moments of Xi) 

(26) nT h n>t}<A sup ^>i±^|; 

U>(t-l)/4tn 1 1 J 

recall that t > 1. Clearly, (26) follows if we check that 

{ min Y(Si - it 2 ) >o)c I sup 5 > 
[l<fc<n/2^ J U>(t-l)/4tn * 2 

Assume the converse; then, by the nonnegativity of Si, 

n/2 cn n/2 

0<J2(S i -it 2 ) = Y,(S i -it 2 )+ £ (^"^) 

j=l «=1 i=cn+l 

<Y:(Scn-it 2 )+ e (*-^-^ 2 

i=l i=cn+l 

where c := We estimate the last expression with 

_ (cn) 2 +2 (n/2) 2 - (cn) 2 i 2 - 1 c 2 2 1/4 - c 2 t 2 - 1 2 

cnbm i • < — n • n . 

cn 2 2 2 ~ 2 2 2 

It is simple to check that the right-hand side is negative, thus we have a 
contradiction. 

Then from (25), (26) and Fact 2 it follows that P{T^ ast > i] = 
Er=/ o{{cn)- 1 ) = o(l) for all t > 1. 

Now let us prove that P{T^ ast < t} — > as n —> oo for all i < 1. Since 
T^ ast = maxi<j< n _i Tj in , we estimate 



l<j<Vn-l 



20 V. V. VYSOTSKY 

In view of (10) and Lemma 1, the sum is 

remains to check that the first probability in the last line tends to zero. 
For a fixed n, all Ty^ 2 ^ are independent because each one is a function of 
{^i}|iV5-i|<Vn/2 ( to be Precise, of X jVE _^ /2+2 , . . . , X jy/K+VE/2 ). Thus, 

lx T { ^ /2) <t\ = pv^i{ T (vV 2 ) < t i < P v^-i{ T < n 
which tends to zero; indeed, P{Tq < t] < 1 by Property 3, Section 3.4. □ 



max 

i<j<V^- 



4. Proofs of Fact 1 and Theorem 1 for the i.d. model. Recall that the 
number of clusters K n (t) is given by (6). Our idea is to study J27=i ^-{t<T z } 

instead of J27=i ^-{t<Ti „}'■ We thus deal with a single sequence T\ and avoid 
considering the triangular array Tj n . 

Let us now prove Fact 1 for the i.d. model. We prove (1) for t ^ 1 without 
any additional assumptions on Xi; for t = 1, we require EX? < oo. The 
properties of the limit function a(t) were studied in Section 3.4, Properties 
3 and 4. 

Proof of Fact 1. We put 
(27) a(t):=F{T >t}. 
Let us first prove (1) for all t < 1. It is sufficient to check that 

m ^-vXm.^^o, —=0. 

Indeed, the stationary sequence ^ft<TA satisfies the law of large numbers by 
Property 5, Section 3.4, and the well-known result of S. N. Bernstein: 

Fact 3. The law of large numbers holds for r.u.'s if there exists a 
sequence r(k) — > such that cov(£i,£j) < r(\i — j\) for all i,j E N. 



By (6), 



Knit) 1 " 

2^ Ht<Ti} 



n n : 

i=i 



1 1 

i=l 



where we used (10) to get the nonnegativity of the right-hand side. Then 
(28) immediately follows from the Chebyshev inequality provided that the 
expectation of the right-hand side tends to zero. By using (10), we obtain 

-y n—1 n—1 

- ]T E(l {t<T . n} - t {t<m ) <-J2 (El (iAB _ i)} - El {t<Ti} ) 

i=l i=l 



n—1 

- ¥ {Ht<1\} + \ t<T (iAn-i) } }, 

i=l 
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which is -Z)r=i°(l) = ^ Lemma 1. To be very precise, Lemma 1 
deals with slightly different indicators, but we can estimate the considered 
probability by repeating the proof of Lemma 1 word for word (or just use 
Property 4, Section 3.4). 

We now check that (1) holds for all t > 1. Using (26) gives E — ^ = 

k T,?=i H T i,n > *} -> as n -> oo and o(t) = follows from the 

Chebyshev inequality. 

It remains to check that (1) holds for t = 1 if EA 2 < oo to conclude 
the proof. If DAj = 0, then the situation is deterministic, this case was 
described in Introduction. Here we always have K n (l) = 1 and (1) is true. 
If < H>Xi < oo, then by Property 3 from Section 3.4, we have a(l) = and 
P{To = 1} = 0; consequently, a(t) = P{To > t} is continuous at t = 1. Then 

(1) is true for t = 1 since < ^jp- < a(t) for any t G (0, 1) and 

a(t) ->a(l) =0 as i / 1. □ 

Now we prove Theorem 1 for the i.d. model. We think of D[0, 1] as of a 
separable metric space equipped with the Skorohod metric d, which induces 
the Skorohod topology. 

PROOF of Theorem 1. At first, we prove (2). In view of representation 
(6) for K n (t), relation (2) follows from the relation 



(29) sup 

0<t<l-£ 



Y n—1 y n 

i=i v i=i 



for all e G (0, 1) 



and the existence of a centered Gaussian process K(-) on [0, 1) such that 
( 3 °) ^{ilHt^-MtA^Ki-) inD[0,l- £ ] for all eG (0,1) 

~i=l ) 



11 



T) IP 

Indeed, if Y n — ► Y and d(Y n , Y£) — ► for some random elements Y n , Y^, Y 

of the separable metric space D[0, 1 — e], then Y^ — > Y; recall that d(Y n , YJQ < 
sup te[0)1 _ e] \Y n {t) -Y£(t)|. 

We start with (29). It is sufficient to prove that the expectation of the 
left-hand side tends to zero. Since the supremum of a sum does not exceed 
the sum of suprema, let us check that 

^ n—1 

(31) SU P \Mt<T in }-Mt<Ti}\^0 for alls G (0,1). 

V n i= i 0<t<l-e 

By (10), we have 

E sup |lr t<T } -lr t<T} |<E sup (1 ( (,a»-.) , - tfKTA ) 
0<i<l-e 0<t<l-e i ^ i ' 
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= P{T 4 / T^ n ~ l \ Ti < 1 - e} 

+ P{l{l- e <Ti} / !| 1 _ £ < T (iAn-i) } }, 

where the last equality was obtained via the formula of total probability. 
Combining the estimates together and using Lemma 1, 

^S E o^tJ 1{4<T - } " 1{ ^ }1 



1=1 v 1=1 



The last expression is 

0( n 3/2- 7 ) an d (31), which implies (29), follows. 
Now let us prove (30). As long as 

Un{t) := -^{E ht<T %} - na(t)| = - (1 - a(t))j, 

the [/„(•) is the empirical process of stationary r.v.'s Tj with the continuous 

common distribution function 1 — a(t). By K(-) = —K(-), (30) is equivalent 
to the existence of a centered Gaussian process K{-) on [0, 1) such that 

(32) U n {-)^K{-) inZ)[0,l-e] for all eG (0,1). 

We will use the following result from Lin and Lu [17], Section 12 on 
convergence of empirical processes. They attribute this statement to Q.-M. 
Shao, who published it in 1986, in Chinese. 

Fact 4. Let £j be a sequence of stationary strongly mixing r.v. J s dis- 
tributed on [0,1], and let F be the common distribution function of £j. 
Suppose F(x) = x on [0,1] (i.e., £j are uniformly distributed) and the co- 
efficients of strong mixing of the sequence F(£i) decrease as 0(k~( 2+ ^) 
as k — > oo for some 5 > 0. Then the empirical processes of £j weakly con- 
verge in D[0, 1] to a centered Gaussian process with the covariance function 

E Je z cov (%,<s}' %<*})• 

Remark. The limit Gaussian process is a.s. continuous on [0, 1]. Fact 4 
also holds true if F is an arbitrary continuous distribution function. 

The a.s. continuity of the limit process could be concluded by a compar- 
ison of the proof from Lin and Lu [17] with the proof of Theorem 22.1 from 
Billingsley [3]. The statements and the proofs of these theorems are identical, 



CLUSTERING IN A STOCHASTIC MODEL OF ONE-DIMENSIONAL GAS 23 



but Lin and Lu do not state the continuity while Billingsley does. Further, 
since -F(£i) is uniformly distributed on [0, 1] if F is continuous, Fact 4 holds 
true for every continuous F; see the proof of Theorem 22.1 by Billingsley [3] 
for explanations. 

Recall that we need to prove the convergence of the empirical process of 
Tj. It seems that the r.v.'s Tj are not strongly mixing; but min{Tj, 1 — e} are 
strongly mixing because of Property 8, Section 3.4. These variables are not 
continuous and so we need to fix them. Let us fix an s G (0, 1), and let oti 
be i.i.d. r.v.'s independent of all Tj and, say, uniformly distributed on [0, e]; 
we define T{ := min{T, 1 — e} + l{Ti>i-s} a i- 

The stationary variables Tj are distributed on [0, 1] , their common dis- 
tribution function G is continuous, and the coefficients of strong mixing 
of G(Tj) decrease as o(/c 2-7 ). The proof of the last statement is the same 
as the proof of Property 8 from Section 3.4. Indeed, approximate the vari- 
ables G(f ), G(f_i), . . . from the "past" by G(T {fc/2) ), G(f[\ /2+1) ), . . . where 

T^ 1 ' := mm{T^ m \ 1 — e} + 1, ( m) x a>f, use the analogous approximation 

i-j — £ i 

for the variables from the "future"; and then repeat word for word the ar- 
guments of the previous proof. 

Now, recalling that 7 > 4, we see that T satisfy the assumptions of Fact 4, 
with the only difference that their distribution is not uniform. By U n (-) 
denote the empirical process of T; clearly, U n (-) coincides with the empirical 
process U n (-) of T on [0, 1 — e]. By the remark to Fact 4, we conclude that 
first, 

(33) U n (-)^K(-) in£>[0,l], 

where K{-) is a centered Gaussian process with the covariance function 

R(s,t) ■=Y, cm ( 1 {f <s}^ 1 {f l <t}) 

igZ 

and, second, trajectories of K(-) are a.s. continuous on [0,1]. 

[There exists a simpler and more elegant proof of (33). Note that {Tj}j £ z 
are associated as coordinate-wise nondecreasing functions of associated r.v.'s 
{Ti,ai}i<=z, see (a), (b) and (d) from Property 6, Section 3.4. Then we can 
obtain (33) applying the result of Louhichi [18] on convergence of empirical 
processes of stationary associated r.v.'s instead of using Fact 4. This the- 
orem requires only cov(F(^o), F^)) = 0(k^^ + ^), which could be proved 
analogously to Property 5, Section 3.4. Thus we avoid the complicated esti- 
mations of the strong mixing coefficients, and the proof of (33) is becomes 
much simpler. The only problem is that this proof requires 7 > 5. 

We also note that the a.s. continuity of K(-) could be proved directly, 
without referring to the proof of Fact 4. The arguments should be the same 
as in the proof of the continuity of X Umf (-) in Section 5.] 



24 



V. V. VYSOTSKY 



Define 

(34) R(s,t) :=^cov(l {To < s} ,l {T .< t} ), 

which is, evidently, equal to R(s,t) on [0,1 — e] 2 . Since R(s,t) is positive 
definite and e > is arbitrary, the function R(s,t) is positive definite on 
[0, l) 2 . Therefore, by Lifshits [15], Section 4, there exists a centered Gaussian 
process K(-) on [0,1) with the covariance function R(s,t). The trajectories 

of K(-) are a.s. continuous on [0, 1) by K(-) = K(-) on [0, 1 — e], arbitrariness 
of e > 0, and the a.s. continuity of K{-) on [0, 1]. 

Finally, by (33), U n (-) = U n (-) on [0, 1 - e], K(-) = K(-) on [0, 1 - e], and 
the a.s. continuity of K(-), we get (32). Since (32) implies (30), we conclude 
the proof of (2). 

Only the stated properties of R{s,t) remain to be proven. The continuity 
of the joint distribution function of continuous variables To and Tj implies 
that cov(l| To<s }, t{Ti<t}) i s continuous on [0, l) 2 for every i > 0. Then, in 
view of (21), R(s, t) is continuous on [0, l) 2 as a sum of uniformly converging 
series of continuous functions. 

The strict positivity of R(s,t) on (y/Jl, l) 2 trivially follows from (34), (23) 
and cov(lr To<s i, lr To<t \ ) = a(sVf)(l — a(sAt)) > 0; the last inequality holds 
by Property 3, Section 3.4. The R(s,t) = on [0, l) 2 \ ( v ^t, l) 2 follows from 
^{Ti < y/JI} = 0, which holds by Properties 3 and 4 from Section 3.4. □ 

We note that (3) holds for 1 7^ 1 under the less restrictive condition EX 2 < 
00. For t < 1, the proof is almost the same: By (29), which is true for 7 > 3/2, 
we conclude that (3) holds if the stationary associated sequence l/^y.i 
satisfies the central limit theorem. Then we refer to the central limit theorem 
for stationary associated sequences from Newman [21]; his theorem requires 
only R(t, t) < 00, that is, the convergence of the right-hand side of (34). This 
condition holds by (13) and Fact 2. For t > 1, relation (3) holds true with 
a 2 {t) = because of Proposition 3. 

Finally, note that the process K{-) is associated, that is, the r.v.'s 
{if (t)} tg [ 0i i) are associated. In fact, by (6), Property 6 from Section 3.4, 
and Condition (b) from the same Property 6, the processes Kn ^')~^ a ^ are 
associated for every n. Then K(-) is associated by (2) and (c), Property 6. 

5. Proof of Theorem 1 for the uniform model. There exists a simple 
method that allows to extend results from the Poisson model to the uniform 
model and vise versa. The method is based on the next statement (see 
Karlin [13], Section 9.1). 
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Fact 5. Let Si be an exponential random walk. Then for any k>l, we 
have 

(35) ( TT^-j ■ ■ > 7T^) = ( u i,k,U 2 ,k, • • • , U kik ), 

\Dk+l Jk+l Jk+l/ 

where Ui t k are the order statistics of k i.i.d. r.v.'s uniformly distributed on 
[0,1]. Moreover, the random vector in the left-hand side of (35) is indepen- 
dent of Sk+i- 

Therefore, if x?° lss (0) = -Sj are the initial positions of particles in the 
Poisson model, then for the initial positions of particles in the uniform model, 
we have xj|" lf (0) = ■ x^° lss (0). By Proposition 2 and (5), we conclude 
that 



(36) Tff = fc l T?™ s , /?„ := ^ ^±i, 

and hence, using (6), we get 

(37) K^(t) = K^ iss (p n t). 

Note that the process K^ mi (-) and the r.v. j3 n are independent since val- 
ues of the process are defined by x^ lf (0), . . . ,x^" lf (0), which are mutually 
independent of (5 n by Fact 5. 

Now we prove Theorem 1 for the uniform model. 

Proof of Theorem 1. Denote 

K Vnii (t) - na(t) 
Y n (t) : = " ^, Z n (t) := y/E(a(t) - a((3 n t)); 



we stress that Y n (-) and Z n {-) are independent. 

Fix an e G (0, 1). First, it follows from (2) for the Poisson model and (37) 
that 

(38) Yn (-) + Z n (-)^K Poiss (-) in £[0,1 -£]. 

Indeed, the process Y n (-) + Z n {-) is obtained from -^(K% oiss (-) - na(-)) by 

F 

the random time change 1 1— ► f3 n t] and since \\/3 n t — £||c[o,l-e] — * 0, we have 



by the definition of the Skorohod metric d. 

Second, from Fact 1, (15), and (27) it follows that a Unif (t) = a Poiss (t) 
jpjjT-oiss > t} = 1 — t 2 for < t < 1, and by the central limit theorem, 

(39) Z n {t) -^U t 2 i] in D[0,l-e), 
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where rj is a standard Gaussian r.v. 

We claim that (38), the independence of Y n (-) and Z n (-), and (39) yield 
the weak convergence of Y n {-) in D[0, 1 — e]. Let us check the tightness of 
Y n (-) and the convergence of their finite-dimensional distributions. 

The tightness of Y n (-) in D[0, 1 - e] follows from Y n (-) = (Y n {-) + Z„(-)) - 
Z n (-), (38), and (39). Indeed, by the Prokhorov theorem, (38) and (39) yield 
that both sequences Y n (-) + Z n (-) and —Z n (-) are tight. But trajectories 
of —Z n {-) are a.s. continuous because of the continuity of o(-), and the 
tightness follows from the continuity of addition +:DxC^D and the fact 
that under any continuous mapping, the image of a compact set is also a 
compact set. 

Now we study convergence of finite dimensional distributions of Y n (-). 
Recall that the characteristic function of a centered Gaussian vector in R m 
is e -1 / 2 (^ u > u ) ) where u G W 71 and R is the covariance matrix of the vector. 
Then (38), the independence of Y n (-) and Z n (-), and (39) yield that for the 
characteristic functions of all finite-dimensional distributions of Y n (-), we 
have 

(40) Ee i(Y„(t),u) , e -l/2({-R Poiss fe.*fe)-^*l}^=i«,") ) 

where u G W n , t = . . . ,t m ) G [0, 1-ep, and Y n (t) := (Y^h), . . .,Y n (t m )). 
We stress that (40) is true for every t G [0, 1 — e] m since the limit processes 
in (38) and (39) have continuous trajectories. 

We see that the matrix {R Poms (tj,tk) — tjtl)™ k=1 is positive definite for 
any t = (t%, . . . ,t m ) G [0, 1 — e] m and m > 1 since the absolute value of the 
left-hand side of (40) does not exceed one. Putting 

R lJnii ( s ,t):=R Poiss {s,t)-s 2 t 2 , 

we have {R^fa, t k ) - t 2 t 2 k }™ k=1 = {i? Unif (tj,tfc)}£ fc=1 ; then the function 
i? Umf (s,t) is positive definite on [0, l) 2 since e > is arbitrary. Thus, by 
Lifshits [15], Section 4, R Vnit (s,t) is the covariance function of some centered 
Gaussian process i^ Umf (-) on [0,1). 

Relation (2) is thus proved. Now check that K Unif (•) G C[0, 1 - e] a.s. to 
conclude the proof of Theorem 1 for the uniform model. 

For this purpose, let us prove that a.s., trajectories of Y n (-) have jumps 
of size only. In fact, the jumps of Y n (-) coincide with the jumps of 

-^K^ mi (•)> whose jumps are of size ^= if and only if T^™ f ^ for 

1 < h £ h < n - I. By (36), we need to verify that Tj^f / T Po ™ a.s. 
for 1 < ji ^ j2 < n — 1. This relation follows from (20) if H(ki,ji,h) / 
H(k2,j2,h) a -s. for j\ ji and &i,&2>^l>^2 > 1- The last a.s. nonequality is 
obvious because if the equality holds true, then a certain nontrivial linear 
combination of i.i.d. exponential equals zero. 
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Then there exist a.s. continuous Y n {-) such that sup t6 [ 0j i_ £ ] \Y n (t) — Y n (t)\ < 
-7= a.s.; consequently, d(Y n ,Y n ) < A= a.s. Then by Y n (-) K Vmi (-), we 

also have Y n (-) £T Unif (•)• But 1 = liminf P{Y n (-) G C} < P{i<C Unif (-) G C} 
since CcD is closed in the Skorohod topology, therefore, a.s., X Umf (-) is 
continuous on [0, 1 — e]. 

Since e G (0, 1) is arbitrary, a.s., i^ Umf (•) is continuous on the whole inter- 
val [0, 1). The R Vnii {s,t) = R Poiss (s,t) - s 2 t 2 is continuous on [0, l) 2 because 
R PoiaB (s,t) is. □ 

6. The number of clusters at the critical moment. Now we turn our 
attention to the number of clusters at the critical moment t = 1. We are 
interested in the behavior of 

K n (l)-na(l) _ K n (l) 

which is the left-hand side of (3) at t = 1; here we have a(l) = under 
EX 2 < oo, see Property 3, Section 3.4. 

We do not know if this sequence is weakly convergent, but we hope that 
it is. We also have a naive guess that its limit is Gaussian because the limit 
in Theorem 1 is Gaussian. In view of K n (l) > 1, this conjectured weak limit 
is nonnegative, hence it is Gaussian if and only if it is identically equal to 
zero. However, the results of this section show that the limit is nonzero, thus 
our guess on Gaussianity fails. 

The study of convergence of — is quite complicated. Therefore, in this 
section, we consider only the Poisson model. First, let us prove the following 
statement. 



Proposition 4. In the Poisson model, we have lim n _ >00 P{.R' n (l) = 
1} >0. 

Proof. On the one hand, K n {V) — 1 is equivalent to I^ a p ; ss < 1, where 
^rrPoiss denotes the moment of the last collision in the Poisson model. On 
the other hand, a result by Giraud [8] states that in the uniform model, 

v^Xif - 1) ^ SU P {-T— t W (f) dy- - Tw (y) dy) =: r, 

0<z<l \1 — X Jx X Jo J 

where W (•) is a Brownian bridge. Now, by (36), we have X^ nif = (3~ 1 X^ a p t oiss 
hence 

(41) VnGfl5r 1 2ftU--l)-^r. 
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But from the central limit theorem and the law of large numbers, 

(42) Vn((3 n -1 = -= — ► -, 

y/n y/S n+ i{y/S n+ i + y/n) 2 

where rj is a standard Gaussian r.v. and 5j is a standard exponential ran- 
dom walk that defines initial positions of particles. Since, in view of Fact 5, 
r n^Unif = Pn lT npoUss and &n are independent, from (41), (42), and the law 
of large numbers it follows that 

Vn(T n . Poiss -1) — > t ---t + -, 

where r and r\ are independent. Thus, 

hm ¥{K n (l) = 1} = Jim P{7^f* oiss < 1} = p{t + | < oj > 0. n 

The main advantage of the Poisson model is that, by Lemma 2 and Prop- 
erty 4, Section 3.4 we have ¥{Tj :Tl > 1} = epjp n -j, where 

p fc :=p| min £7$ - E#) > 1 
[i<m<fc. =1 J 

and S*j is a standard exponential random walk. We say that the sequence of 
r.v.'s Y^=\{Si — ESj) is an integrated random walk. In the proof of Property 
3, Section 3.4, we showed that pt — ► as k — > oo. Therefore, it is reasonable 
to say that pk are the unilateral small deviation probabilities of an integrated 
centered random walk. 

We need to obtain the asymptotics of p^ — > to continue the study of 
convergence 

of Unfortunately, the results of the rest of this section 

are completely dependent on the correctness of the following conjecture. 

Conjecture 1. We have p/- ~ ci/c -1 / 4 as k^ oo for some c\ G (0, oo). 

Simulations show that the conjecture is true and c\ ~ 0.36. The weaker 
form pk x A; -1 / 4 of Conjecture 1 was proved by Sinai [22], but only for inte- 
grated symmetric Bernoulli random walks. It also interesting to note that, 
by McKean [19], the unilateral small deviation probabilities of an integrated 
Wiener process have the same order as T — > oo: 

(43) F^mh^J' W(u) du > -1 j ~ c 2 T" 1/4 

for some c 2 G (0, oo). The left-hand side of (43) is a unilateral small deviation 
probability since 

(44) pi min /* W(u) du > -1 1 = pj min /" W(u) du > -T~ 3/2 1. 

[0<s<TJq J lo<s<i,/o J 
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To be precise, McKean was interested in a more general problem, and 
some calculations are required to obtain (43) from his results. Therefore, 
we additionally refer to Isozaki and Watanabe [12] who state (43) explic- 
itly. 

By the results mentioned above, we also suppose that Conjecture 1 is 
true for other integrated centered random walks that satisfy some moment 
conditions. 

Now we are able to prove the following result on convergence of K ^ ■ 

Proposition 5. Suppose Conjecture 1 holds true. Then in the Poisson 
model, we have 

(45) KmE/^fi^Uca, S u P e(^^) 2 <oo 



n J n>l \ V n 

for some C3 G (0, 00); the sequence K ^ is tight and uniformly integrable; 

and the limit of any weakly converging subsequence of takes value zero 

with positive probability, but is not identically equal to zero. 

Numerical simulations show that g ^ is weakly convergent and that this 
convergence is quite fast. In Figure 1 we present the (empirical) distribution 
function of for n = 10, 000. Since the simulations performed for n = 

40,000 showed a hardly perceptible difference, this function seems to be a 
good candidate for the distribution function of the conjectured limit. 

Note that if we weaken Conjecture 1 to p^ x A; -1 / 4 , then Proposition 5 
still holds true with the only difference that M Kn xy >c 1. 

J s/n 

Proof of Proposition 5. We start with the convergence of the ex- 
pectation. On the one hand, by (6) and Lemma 2, 

and on the other hand, 

£ r V4 (n _ r x/4 = I £ f±\ (l--) — B(3/4, 3/4) 

as the integral sum of Beta function. Then it follows from Conjecture 1 and 
standard arguments that K K ^' converges to C3 := ecf B(3/4, 3/4) > 0. 
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12 3 4 

Fig. 1. The distribution function of for n — 10,000. 

Now we check the uniform boundedness of M( K ^^ ) 2 . By (6) it is sufficient 



to prove that 



(46) 



1 



n-l 



sup - J2 F i T ^ > T J> > < °°- 



Suppose % < j; then by using (8) and properties of Fkj,l(')> we get 



F{T itn >l,T j>n >l}=w{ min > 0, min F fejV (l)>o) 

|_l<fc<n— i l<K<n— j J 

KKi KKj 

<p( min J F fe)M (l)>0 ) min >ol, 
U<*<(j'-*)/ 2 l<fc<n-j ,J ' J 



KKi 



l<«<0'-»)/2 



where by (j — i)/2 we mean \{j — i)/2\. The minima in the last expres- 
sion are independent as functions of {X m } m <( i+j y 2 and {X m } rn >( i+j y 2 +i, 
respectively; hence 

P{T,,„>l,T 3in >l}<p( min %(1)>01p{ min . F JM ,,(1)>0 

l.l<fc<(j— 1)/2 J l l<k<n—j 



KKi 



l<l<U-i)/2 
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= ^{ T i,i+(j-i)/2 > 1} • P{?( J -_ i )/2 )n _j + (j_ i )/2 > 1} 
2 2 

= e PiP[(j-i)/2]^-j' 

where the first equality follows from (8) and the second follows from Lemma 2. 
Recalling Conjecture 1, we get 

2 71—1 ^ 71—1 

- E ff ) {7i,„>l,T jV „>l}<- E eV^-ii/^Pn-i 
j,i=i,i^j *j=i,«^j 

71-1 

c 



<- E ^- 1/4 ni-^i/2r 1/2 (--jr 1/4 



n . 

M=1,»7=J 

71—1 / ■ 

<4 E f 1 

M=1,»7=J 



-1/4 



1 _ 1 

n n 



1/2 / \ -1/4 

1 - - 

n 



for some c > 0. The last expression is an integral sum converging to 

c f 1 f 1 x~ 1/A \x-y\- l/2 (l- y y 1/4 dxdy, 
Jo Jo 

and it is a simple exercise to check that the integral is finite. This concludes 
(46). 

The uniform integrability of K ™j£ follows from the second relation from 

(45), see Billingsley [3], Section 5, and the tightness follows from the uniform 
integrability. 

Finally, suppose Kn j— - £ for some subsequence rii — > oo and some 



r.v. £. Then E£ = C3 > by the uniform integrability and (45), and hence 
£ is not identically equal to zero. But the distribution of £ has an atom at 
zero since by Proposition 4 and properties of weak convergence, 

= 0} = lim P{£ < e} 

e\0 

> lim lim sup p( < e 

e\0 i^oo I V™i 



> lim lim ¥{K ni (1) = 1} > 0. n 

7. Open questions. 1. The number of clusters at the critical moment 
t=l. 

Here the main question is if Conjecture 1 holds true. Even by itself, this 
problem is worth studying. 

But even if Conjecture 1 is true, we still do not have a proof of weak 
convergence of — it is only known that this sequence is tight. The author 
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strongly believes, relying on numerical simulations, that the limit exists. It 
would be interesting to find this conjectured limit, which should be nontrivial 
by Proposition 5, in an explicit form. 

2. The weak convergence of j^zn^zggO on ^ e whole interval [0, 1]. 

It is very natural to ask if it is possible to strengthen Theorem 1 by 
proving the weak convergence of A "" ( ^- na( ' ) in 25 [0,1]. This complicated 
problem returns us again to Question 1 because the weak convergence of 
^(•)^_ nffl (-) [ n ]j[q^ implies the weak convergence of Kn ( 1 )~^ ia ( 1 ) = , see 

Billingsley [3], Section 15. But even if converges, its weak limit K(l) 

is not Gaussian, hence the limit process K(-), which is Gaussian on [0,1), 
is no more Gaussian on [0,1]. Therefore, it is doubtful that Theorem 1 is 
true in D [0, 1]; at least, one should provide a proof completely different from 
the presented one. Also, it is unclear how to define the finite-dimensional 
distributions of the non-Gaussian K{-) on [0, 1] because simulations show 
that K{1) would not be independent with K(t) for t < 1. 

3. The number of clusters in the warm gas. 

In the presented case, initial speeds of particles are zero. This model is of- 
ten called the cold gas according to its zero initial temperature. We introduce 
a new model stating that initial speeds of particles are a n vi,a n V2, ■ ■ ■ ,a n v n , 
where Vi are some i.i.d. r.v.'s and a n is a sequence of normalization con- 
stants. This model, called the warm gas, was studied in many papers, for 
example, [14, 16, 20, 25]. 

It is of a great interest to study the behavior of K n (t) in the warm gas. 
In [25], the author proved that in the basic case where a n = 1 for all n and 

Kvf < oo, we have Kn W — 5-U for all t > 0. The question is to find a normal- 
ization of K n (t) leading to some nontrivial limit. Clearly, this normalization 
depends on a n , but it is very possible that there is an effect of phase tran- 
sition similar to the one discovered by Lifshits and Shi [16]: If a n are small 
enough, then the gas has a low temperature and the normalization is the 
same as in the cold gas. If a n are big enough, as in the basic case a n = 1, 
then the normalization and the behavior of the gas differ entirely from the 
case of the cold gas. 

The author believes that the localization property, which is described in 
Section 3, could be helpful in a study of these questions. 

It is also interesting to compare the behavior of K n (l) in the warm and in 
the cold gases; in the warm gas, the moment t = 1 plays the same "critical" 
role as in the cold gas, see Lifshits and Shi [16]. The variable K n {\) was 
studied by Suidan [24] , who considered the warm gas with a n = 1 and deter- 
ministic initial positions of particles (his initial positions were -,-,...,-). 
For this case, Suidan found the distribution of K n (l) and showed that 
E2T n (l) ~ logn. Recall that in the presented case, E2f n (l) ~ cj,^fn. 
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4. The number of clusters in ballistic systems of sticky particles. 

A sticky particles model is called ballistic if it evolves according to the 
laws introduced in Section 1, but in the absence of gravitation. Such models 
are, in some sense, more natural than gravitational ones because the ba- 
sic assumption that gravitation does not depend on distance is sometimes 
confusing. However, an unpublished paper of Lifshits and Kuoza shows that 
certain gravitational and ballistic models are tightly connected. 

It seems interesting to study the number of clusters in the ballistic model. 
The author does not know any results in this field. 
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